A QAP Solver with CUDA GPU Computing Architecture A Two Page Description of the Application Submitted for GECCO 2009 Competition : GPUs for Genetic and Evolutionary Computation

نویسندگان

  • Noriyuki Fujimoto
  • Shigeyoshi Tsutsui
چکیده

This application solves the quadratic assignment problem (QAP) [1]. In QAP, we are given l locations and l facilities and the task is to assign the facilities to the locations to minimize the cost. We chose QAP for the following reasons: First, problem sizes of QAPs in real life problems are relatively small compared with other problems in permutation domains such as the traveling salesman problem (TSP) and the scheduling problem. This enables us to use the shared memory of a GPU e ectively. Second, QAP is one of the most di cult problems among problems in permutation domains. Thus, QAP is a good test bed to evaluate an optimization algorithm. As the parallel method, a multiple-population, coarsegrained GA model was adopted. Each subpopulation is evolved by a multiprocessor in a CUDA GPU. At predetermined intervals of generations all individuals in subpopulations are shu ed via the VRAM of the GPU. Applying local search in solving QAP is very common in evolutionary algorithms [2, 3, 4]. Popular local searches used in solving QAP are the taboo search (TS) and 2-OPT heuristic. However, to apply these local searches e ciently, we need a large memory for each parallel thread. For example, the size needed for taboo search [5] is 32l + l bytes. To take this size space for each thread in the shared memory with current GPU is almost impossible. Thus, we will not apply any local search for GPU. We applied local searches (2 OPT Best Move and 2 OPT First Move) for CPU. The instances on which this algorithm was tested were taken from the QAPLIB benchmark library at [6]. Our results were promising, showing the performance of GPU programs without local searches is comparative to or even better than the performance of CPU programs with local searches on the Intel R ⃝ Core i7 965 processor, which is one of the fastest processors available currently.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accelerating high-order WENO schemes using two heterogeneous GPUs

A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...

متن کامل

An approach to Improve Particle Swarm Optimization Algorithm Using CUDA

The time consumption in solving computationally heavy problems has always been a concern for computer programmers. Due to simplicity of its implementation, the PSO (Particle Swarm Optimization) is a suitable meta-heuristic algorithm for solving computationally heavy problems. However, despite the simplicity, the algorithm is inefficient for solving real computationally heavy problems but the pr...

متن کامل

Efficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems

Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...

متن کامل

Numerical Simulation of a Lead-Acid Battery Discharge Process using a Developed Framework on Graphic Processing Units

In the present work, a framework is developed for implementation of finite difference schemes on Graphic Processing Units (GPU). The framework is developed using the CUDA language and C++ template meta-programming techniques. The framework is also applicable for other numerical methods which can be represented similar to finite difference schemes such as finite volume methods on structured grid...

متن کامل

MPI- and CUDA- implementations of modal finite difference method for P-SV wave propagation modeling

Among different discretization approaches, Finite Difference Method (FDM) is widely used for acoustic and elastic full-wave form modeling. An inevitable deficit of the technique, however, is its sever requirement to computational resources. A promising solution is parallelization, where the problem is broken into several segments, and the calculations are distributed over different processors. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009